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Abstract. Let F n be an n by n symmetric matrix whose entries are bounded by n 1 for 
some 7 > 0. Consider a randomly perturbed matrix M n = F n . + X n , where X n is a random 
symmetric matrix whose upper diagonal entries Xij, 1 < i < j, are iid copies of a random 
variable £. Under a very general assumption on £, we show that for any B > there 
exists A > such that P(a„(M n ) < n~ A ) < rC B . The proof uses an inverse-type result 
concerning the concentration of quadratic forms, which is of interest of its own. 



1. INTRODUCTION 

Let F n be an n by n matrix whose entries are bounded by n°^\ Consider a randomly 
perturbed matrix M n = F n + X n , where X n is a random matrix whose entries are iid copies 
of a random variable. It has been shown, under a very general assumption on £, that the 
singular value of M n cannot be too small. 

Theorem 1.1. [30} Theorem 2.1] Assume that M n = F n + X n , where the entries of F n are 
bounded by n 7 , and the entries of X n are iid copies of a random variable of zero mean and 
unit variance. Then for any B > 0, there exists A > such that 



PK(M n ) < n- A ) < n~ B . 
Here a n {M n ) is the smallest singular value of M n , defined as 



a n (M n ) := inf ||M n x[|. 

||x|| = l 

The dependence among the parameters in Theorem 1 1 . 1 1 were made explicitly in [32]. Under 
the stronger assumption that £ has sub-Gaussian distribution, Rudelson and Vershynin [2T] 
obtained an almost best possible estimate on the tail bound of o~ n (M n ). For more results 
regarding this random matrix ensemble we refer the reader to |2H [5D] [52] . 

One important application of Theorem II. II is a polynomial bound for the condition number 
of M n . 

Corollary 1.2. |30} Corollary 2.10] With the same assumption as in Theorem \l.ll for any 

B > 0, there exists A > such that 
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P((Ti(M n )/<r n (M n ) < n A ) > 1 - n~ B 

The condition number k(M) = u\ (M) /cr n (M) of a matrix M plays a crucial role in numeri- 
cal linear algebra. The above corollary implies that if one perturbs a fixed matrix F of small 
spectral norm by a (very general) random matrix X n , the condition number of the resulting 
matrix will be relatively small with high probability. This fact has some nice applications 
in theoretical computer science. (See for instance |23t 12^] for further discussions on these 
applications). 

Another popular model of random matrices is that of random symmetric matrices; this is 
one of the simplest models that has non-trivial correlations between the matrix entries. A 
significant new difficulty in the study of the singularity of X n (or of M n in general) is that 
the symmetry ensures that det(JT n ) is a quadratic function of each row, as opposed to the 
regular random ensembles in which det(X„) is a linear function of each row. 

Answering an old question of Weiss, a recent result of Costello, Tao and Vu [3] shows that 
X n is almost non-singular with high probability. 

Theorem 1.3. O Theorem 1.3] Assume that the upper diagonal entries Xij of X n are iid 
Bernoulli random variables. Then X n is non-singular with probability 1 — n~ 1 / 8+ °( 1 \ 

In [T7], the current author improved the bound to any polynomial order. 

Theorem 1.4. |171 Theorem 1.2] Assume that the upper diagonal entries of X n are iid 
Bernoulli random variables. Then X n is non-singular with probability 1 — 0{n~ B ) for any 
B>0. 

The goal of this note is to study the smallest singular value of randomly perturbed matrices 
M n under general assumptions on £. 

Condition 1 (Anti-concentration). Assume that £ has zero mean, unit variance, and there 
exist positive constants c\ < c 2 and C3 such that 

p(ci < <c 2 ) >c 3 , 

where £' is an independent copy o/£. 

Note that the last condition is more general than the assumption of boundedness of (2 + rf)- 
moment, for some r\ > 0. 

Theorem 1.5 (Main theorem). Assume that the upper diagonal entries of X n are iid 
copies of a random variable £ satisfying Condition [7J Assume also that the entries fij of 
the symmetric matrix F n satisfy \fij\ < n 1 for some 7 > 0. Then for any B > 0, there 
exists A > such that 

P(a n {M n ) < n~ A ) < n~ B . 

Remark 1.6. By technical reasons, we assume that F n is symmetric, although we believe 
that the above results hold without this assumption. 
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Our result implies a polynomial bound for the condition number of M n , assuming that the 
spectral norm of X n is bounded (which is the case for most of matrix ensembles). 

Corollary 1.7. With the same assumptions as of Theorem \1.5\ and assume that \\X n \\ = 
n O(!). Then for any B > 0, there exists A > such that 

P(k(M„) > n A ) < n' B . 

As another application, we show that the determinant of random symmetric matrices is 
concentrated around its mean with high probability. 

Corollary 1.8. Assume that the upper diagonal entries Xij of X n are iid copies of a random 
variable £ of zero mean, unit variance, and there is a constant C > such that P(|£| < 
C) = 1. Assume furthermore that the entries of the symmetric matrix F n also satisfy 
\fij\ < C. Let B > be any constant. Then with probability 1 — 0(n~ B ), 

| det(M n )| = expi-Oin 1 / 3 logn))E(| det(M n )|), 
where the implied constants depend on B and C . 

This corollary refines an important case of |3H Theorem 34] obtained by Tao and Vu by a 
different method, which in turn compensates previously known results on the concentration 
of the determinant of non-symmetric random matrices (cf. [H (H [27] ) . 

Notation. For a matrix M we use the notations rj(M) and Cj(M) to denote its i-th row 
vector and its j-th column vector respectively; we use the notation (M)ij to denote its ij 
entry. 

As usual, we use rj to denote random Bernoulli variables (thus rj takes values ±1 with 
probability 1/2). For a given < \i < 1, we use rf$ to denote random Bernoulli variables 
of parameter /i (thus rf 1 *' takes values ±1 with probability [i/2 and zero with probability 

i-m)- 

Here and later, asymptotic notations such as 0,Q,Q, and so for, are used under the as- 
sumption that n — > oo. A notation such as Oc(-) emphasizes that the hidden constant in 
O depends on C. If a = 0(6), we write 6 < a or a > 6. If a = Q(b) and b = f2(a), we write 
a x b. 



2. The approach to prove Theorem 11.51 

For the sake of simplicity, we will prove our result under the following condition. 
Condition 2. With probability one, 



< n 



B+l 



for all 
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In fact, because £ has unit variance, we have 

Pflsijl >n B+l )=0{n- 2B - 2 ). 

Thus, we can assume that \xij\ < n B+l at the cost of an additional negligible term o(n~ B ) 
in probability. 

We next assume that a n (M n ) < n~ A . Thus 

M n x = y, 

for some ||x|| = 1 and ||y|| < n~ A . There are two cases to consider. 
Case 1. det(M n ) = 0. In this case £ has discrete distribution. 

We first show that it is enough to consider the case of M n having rank n — 1, thanks to the 
following result. 

Lemma 2.1. For any 1 < k < n — 2, 

P(rank(M n ) = k < n - 2) < fi Cl (l)P(rank(M 2n _ fc _i) = 2n - k - 2). 

We deduce Lemma EO from a useful observation by Odlyzko, whose simple proof is presented 
in Appendix lAl 

Lemma 2.2 (Odlyzko's lemma^lQj). Let H be a linear subspace in R n of dimension at 
most k < n. Then 

P(u G H) < (VT^~3) n ~ k , 
where u = (/i + x\, . . . , f n + x n ), fi are fixed and Xi are iid copies of £. 

Proof, (of Lemma l2.ip View M n+ \ as the matrix obtained by adding the first row and first 
column to M n . Let H be the vector space of dimension k spanned by the row vectors of 
M n . Then the probability that the subvector formed by the last n components of the first 
row of M n+ i does not belong to H, by Lemma I2T21 is at least 1 — {^/Y^c^) n ^ k . 

Hence, 

P(rank(M n+ i) = k + 2|rank(M n ) = k) > 1 - (^/T^) n - k . 
In general, for 1 < t < n — k we have 



P(rank(M n+i ) = k + 2t|rank(M n+ i_i) = k + 2(t - 1)) > 1 - (v 7 !^) 



n-t-k+l 



Because the rows (and columns) added to M n+t -\ each step (to create M n+t ) are indepen- 
dent, we have 
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P(rank(M 2n _fc_i) =2n — k- 2|rank(M n ) = k) > 

n—k—l 

> Yl P(rank(M n+i ) = k + 2t|rank(M n+ t_i) = k + 2(t— 1)) 
t=i 

> (i - (vT=^)"- fc )(i - (VT^T-*- 1 ) •••(!- (VT^s)) = n C3 (i). 

□ 

Next we show that in the case of M n having rank n — 1, it suffices to assume that 
rank(M„_i) > n — 2, thanks to the following simple observation. 

Lemma 2.3. Assume that M n has rank n — 1. Then there exists 1 < i < n such that the 
removal of the i-th row and the i-column of M n results in a symmetric matrix M n _\ of rank 
at least n — 2. 

Proof, (of Lemma 12, 3p With out loss of generality, assume that the last n — 1 rows of M n 
span a subspace of dimension n — 1. Then the matrix obtained from M n by removing the 
first row and the first column has rank at least n — 2. □ 

Without loss of generality, we assume that the matrix M n _i obtained from M n by removing 
its first row and first column has rank at least n — 2. We next express det(M n ) as a quadratic 
function of its first row (mn, . . . , m\ n ) as follows. 

det(M n ) = cuiM^md + ^ c ij (M n ^ 1 )mi i mi j 

2<i,j<n 

where cn(M n ) is the first cofactor of M n , while Cjj(M„_i) are the corresponding cofactors 
of the matrix M n _i. 

It is crucial to note that, since M n _i has rank at least n — 2, at least one of the cofactor 
Cij(M n -i) is nonzero. Set c := (J2 2 <i,j<n c iji M n-i) 2 ) 1/2 and := Qj(M n _i)/c. 

Roughly speaking, our approach consists of two main steps. 

• Step 1. Assume that 

Px 11 ,...,x ln ((c u (M n )/c)ml 1 + ^ a^miimy = 0|M„_ X ) > 

2<ij'<n 

then there is a strong additive structure among the cofactors Cjj(M n _i) of M n _\. 

• Step 2. The probability, with respect to M n _i, that there is a strong additive 
structure among the Cij{M n _\) is negligible. 
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We will execute Step 1 by proving Theorem 12.51 below (as a special case). Step 2 will be 
carried out by proving Theorem 12.61 

Case 2. det(M n ) ^ 0. Let C(M n ) = (cij(M n )) be the matrix of the cofactors of M n , we 
have 

C(M n )y = det(M n ) • x. 

Thus ||C(M n )y|| = |det(M n )|. 

We infer that, without loss of generality, 

\cu(M n ) yi + ... cx n {M n )y n \ > | det(M n )|/n 1 / 2 . 
Note that ||y|| < n~ A , thus 

n 

^|ci J (M n )| 2 >n 2A - 1 det(M n ) 2 . (1) 
i=i 

For j > 2, we write 



cij(M n ) = y~]mnCij(M n -i), 

i=2 

where M n _i is the matrix obtained from M n by removing its first row and first column, 
and Qj(M n _i) are the corresponding cofactors of M n _\. 

Hence, by the Cauchy-Schwarz inequality, by Condition [21 and by the bounds < n 1 for 
the entries of F n , we have 



cij (M„) 2 < E™=2 ml Er=2 4( M »-0 

<^^4-(W (2) 

Similarly, for j = 1 we write 



c u (M n ) = y~]mi 2 Ci2(M n -i). 

i=2 



Thus, 
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n 

di(M n ) 2 < n^+^cKMn-!). (3) 
It follows from (P),© and © that 

£ Cjj(^n-i) 2 > n 2 ^ 2B - 2 ^ 4 det(M„) 2 . 

2<i,j<" 

Hence, for proving Theorem 11.51 it suffices to justify the following result. 
Theorem 2.4. For any B > 0, there exists A > such that 

P(( c i ,(M n _ 1 ) 2 ) 1 /2> n ^| det ( Mn )|)<n- i? . 

2<M<" 

To prove Theorem 12.4^ we again express det(M n ) as a quadratic form of its first row. 
det(M„) = cn(M ri )m 2 1 + Cy (M„_i)miim.,i. 

2<ij<n 

In other words, 

det(M n )/c = mndi/c+ a^murrixj, 

2<i,j<n 

where c := (E2<ij<n c *i ( M «-i) 2 ) 1/2 and °*j := ^(Mi-l)/c. 

Roughly speaking, our approach in this case also consists of two main steps. 

• Step 1. Assume that 

'Px l i,...,x ln {\{cii(M n }/c)ml 1 + aij-mumijl < n~ A \M n -i) > n~ B , 

2<i,j<n 

then there is a strong additive structure among the cofactors Cj,-. 

• S'tep £ The probability, with respect to M n _i, that there is a strong additive 
structure among the Cj,- is negligible. 

We now state our main supporting lemmas. 

Theorem 2.5 (Step 1). Let < e < 1 be given constant. Assume that Ylij a lj = 1> where 
ay = dji, and 
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supPz 2 ,...,x n (| ^2 ai i( Xi + + fi) ~ a \ - n A )^ n B 

a 2<ij<ra 

for some sufficiently large integer A, where Xi are iid copies of £. Then there exists a vector 
u = (ui, . . . , ii n _i) satisfying the following properties. 

• ||u|| x 1 and |(u, ri (M n _i))| < n -A/2+o B .^i) f rn-0 B ,e{X) row s of M n -\. 

• There exists a generalized arithmetic progression Q of rank Ob,e{^-) o/nd size n° B ' € ^ 
that contains at least n — 2n e components Ui . 

• All the components ui, and all the generators of the generalized arithmetic progres- 
sion are rational numbers of the form p/q, where \p\, \q\ < n A / 2+ ° B '^ l \ 



We refer the reader to Section [3] for a definition of generalized arithmetic progression. 
Theorem 12.51 which is at the heart of the paper, follows from our study of the inverse 
Littlewood-Offord problem for quadratic forms (Sections [3][6]). 

We next proceed to the second step of the approach showing that the probability for M n _i 
having the above properties is negligible. 

Theorem 2.6 (Step 2). With respect to M n —\, the probability that there exists a vector u 
as in Theorem 1 2. 51 is exp(— Q(n)). 

The rest of the paper is organized as follows. We state and prove the inverse Littlewood- 
Offord result for quadratic forms throughout Sections [3][6j As an application, we prove 
Theorem 12.51 in Section [7] and conclude Theorem 12.61 in Section [HJ We will prove Corollary 
Olin Section [3 

3. The inverse Littlewood-Offord result for quadratic forms 

A classical result of Erdos [6j and Littlewood-Offord [16] asserts that if aj are real numbers 
of magnitude |aj| > 1, then the probability that the random sum Y17=i concentrates 
on an interval of length one is of order 0(n~ 1 ^ 2 ), where Xi are iid copies of a Bernoulli 
random variable. This remarkable inequality has generated an impressive way of research, 
particularly from the early 1960s to the late 1980s. We refer the reader to El [Til E21 1131 
HH [151 EH E21 [251 S3] for further reading regarding these developments. 

Motivated by inverse theorems from additive combinatorics (see |33[ Chapter 5]), Tao and 
Vu brought a new view to the problem: find the underlying reason as to why the concen- 
tration probability of Y^7=l aiXi on a sri ort interval is large. 

Typical examples of that have large concentration probability are generalized arithmetic 
progressions (GAPs). 
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A set Q is a GAP of rank r if it can be expressed as in the form 

Q = {9o + kigi + • • • + k r g r \Ki < ki < K[ for all 1 < i < r} 
for some {g , . . . ,g r },{Ki, . . . ,K r },{K[, . . .,K' r }. 

It is convenient to think of Q as the image of an integer box B := {(k\, . . . , k r ) £ TI'\Ki < 
ki < K'j} under the linear map 

$ : (fci, . . . , k r ) i->- g + hgi H h Avg,,. 

The numbers gi are the generators of P, the numbers K[ and i£j are the dimensions of P, 
and Vol(<5) := \B\ is the volume of P. We say that Q is proper if this map is one to one, 
or equivalently if \Q\ = Vol(Q). For non-proper GAPs, we of course have \Q\ < Vol(Q). If 
— Ki = K[ for all i > 1 and go = 0, we say that Q is symmetric. 

A closer look at the definition of GAPs reveals that if are very near to the elements of 
a GAP of rank O(l) and size n olyl \ then the probability that Y17=i a i Xi concentrates on a 
short interval is of order n~°^ , where Xi are iid copies of a Bernoulli random variable. 

It was shown by Tao and Vu [26j [30] [32] , in an implicit way, that these are essentially the 
only examples that have high concentration probability. An explicit and optimal version 
has been given in a recent paper by the current author and Vu. 

We say that a is <5-close to a set Q if there exists q E Q such that \a — q\ < 5. 

Theorem 3.1 (Inverse Littlewood-Offord theorem for linear forms, [18 J. Let < e < 1 

and B > 0. Let (3 > be a parameter that may depend on n. Suppose that not all ai are 
zero, and 



n 

supP x (| ^a^Xi + fi) - a\< ft) = p> n~ B , 

a . 

i=l 

where x = (x\, . . . ,x n ), and Xi are iid copies of a random variable £ satisfying Condition 
[Jl Then the following holds. For any number n' between n e and n, there exists a proper 
symmetric GAP Q = {^I=i x i9i '■ \ x i\ ^ ^i} such that 

• At least n — n' elements of ai are /3-close to Q. 

• Q has small rank, r = Ob,^), and small cardinality 

\Q\ <max(o B , e (^i),lj . 

• There is a non-zero integer p = Ob i£ (Vji') such that all steps gi of Q have the form 

In this and all subsequent theorems, the hidden constants could also depend on ci, 02,03 
of Condition [TJ We could have written Cl)C3jC3 (.) everywhere, but these notations are 
somewhat cumbersome, and this dependence is not our focus, so we omit them. 
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Theorem 13.11 was proven in [18J with c\ = 1, oi = 2 and C3 = 1/2, but the proof there 
automatically extends to any constants < c\ < C2 and < C3. 

An immediate corollary of Theorem 13.11 is the /3— net Theorem by Tao and Vu [30], which 
plays a crucial rule in their resolution of the circular law conjecture in random matrix theory. 

To prove Theorem 12.51 we need an inverse-type result for the high concentration of the 
quadratic form Y^i a ij( x i + fi)( x j + /?')• A classical approach is to pass the problem to the 
corresponding bilinear form. 

Theorem 3.2 (Inverse Littlewood-Offord theorem for bilinear forms). Let < e < 1 and 

B > 0. Let f3 > be a parameter that may depend on n. Suppose that Ylij a ij = 1 an d 

supP x , y (| V] aij(xi + fi)(yj + fj) - a\ < (3) = p > n~ B , 

a . ., 

where x = (x±, . . . , x n ), y = (yi, . . . , y n ), and Xi and yi are iid copies of a random variable 
£ satisfying Condition^ and fi are fixed. Then there exist a set Lq of size Ob, £ (1), a set L 
of size at least n — 2n e , and integers 7^ k, ka ,io G Lo,i G /, all bounded by n° B ^^ such 
that the following holds. Let R be the matrix defined as 





(k- 


if j g -fo; 




k, 


ifj = i; 






otherwise 



for each i G /; and the other entries kij of R are zero, except the diagonal terms k„,i ^ I 
which are defined to be one. Then the following holds for any i-th row, i G /, of the matrix 
A' = RA, 

P y (\(y + f, ri (A')}\ </3n°^«) > rr°^« (5) 

where f = (/i,...,/ n ). 

One may apply Theorem 13.11 to (0) to conclude that most of the components of ^(^4') are 
very close to a GAP of rank 0(1) and n oi - l \ However, we prefer to keep ([5]) as it is more 
convenient to use. 

Roughly speaking, Theorem 13.21 states that if sup a P x ,y(| Yli j<n a ij( x i + fi)(,Uj + fj) — a l < 
> n~ B , then there exist r = |/o| = 0(1) rows Ti 1 (A), . . . ,ri r (A) of A such that most of 
the remaining rows of A can be written as in the form k\Ti l (A) + • • • + k r Ti r (A) + a, where 
\ki\ = n°( l \ and most of the components of a are very close to a GAP of rank O(l) and 
size n olyl \ 
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We now state our main inverse-type result for high concentration probability of quadratic 
forms. 

Theorem 3.3 (Inverse Littlewood-Offord theorem for quadratic forms). Let < e < 1 and 

B > 0. Let /3 > be a parameter that may depend on n. Suppose that X^=i a ij = 1> w here 

a ij = a ji> an d 

SUpP x (| ^ a ij( x i + fi)( x j + fj) ~ a \ < P) = P > n ~ B - 
a . 

Then there exists a matrix R as in Theorem VS. "A such that for any i £ L , 

P z (|(z,r i (A / ))| < Pn° B >'V>) > n"° s - (1) , 

where z = (z\, . . . , z n ) and z% are iid copies o/r/^ 1//2 )(^ — £,'), where is a Bernoulli 

random variable of parameter 1/2 independent of £ and 

Remark 3.4. All the results presented in this section are in continuous setting. Similar 
results in discrete setting were obtained in [T7] . The reader may also consult that paper for 
more examples and motivations. 

4. A RANK REDUCTION ARGUMENT AND THE FULL RANK ASSUMPTION 

This section, which is independent of its own, provides a technical lemma we will need for 
later sections. Informally, it says that if we can find a proper symmetric GAP that contains 
a given set, then we can assume this containment is non-degenerate. 

Assume that P = {A^gi + • • • + k r g r \ — Ki < k{ < Ki} is a proper symmetric GAP, which 
contains a set U = {u±, . . . .u n }. 

We consider P together with the map $ : P — > R r which maps k\g\ + • • • + k r g r to 
(ki, . . . , k r ). Because P is proper, this map is bijective. 

We know that P contains U, but we do not know yet that U is non-degenerate in P in the 
sense that the set has full rank in R r . In the later case, we say U spans P. 

Theorem 4.1. Assume that U is a subset of a proper symmetric GAP P of size r, then 
there exists a proper symmetric GAP Q that contains U such that the followings hold. 

• rank(Q) < r and \Q\ < O r {l)\P\. 

• U spans Q, that is, 4>{U) has full rank in R, rank (Q), 

To prove Theorem 14.11 we will rely on the following lemma. 

Lemma 4.2 (Progressions lie inside proper progressions). There is an absolute constant C 
such that the following holds. Let P be a GAP of rank r in R. Then there is a symmetric 
proper GAP Q of rank at most r containng P and 
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\Q\ <r Cr \P\. 

Proof, (of Theorem 14, 1|) We shall mainly follow Section 8] . 

Suppose that Q(U) does not have full rank, then it is contained in a hyperplane of R r . In 
other words, there exist integers a\, . . . , a r whose common divisor is one and a±k\ + ■ ■ ■ + 
a r k r = for all (ki,...,k r ) £ $(U). 

Without loss of generality, we assume that a r ^ 0. We select w so that g r = a r w, and 
consider P 1 be the GAP generated by g[ := gi — aiW for 1 < i < r — 1. The new symmetric 
GAP P' will continue to contain U, because we have 

hg[ H h fc r -l5r-l = k l9l H r- ^r^r - w(ai^i H h a r g r ) 

= hgi H h A; r 5 r 

for all (fcx, , Av) G ^(t 7 )- 

Also, note that the volume of P' is 2 r ~ 1 K\ . . . K r —\, which is less than the volume of P. 

We next use Lemma 14.21 to guarantee that P' is symmetric and proper without increasing 
the rank. 

Iterate the process if needed. Because the rank of the newly obtained proper symmetric 
GAP decreases strictly after each step, the process must terminate after at most r steps. 

□ 



5. proof of Theorem 13.21 

For minor technical reasons, it is convenient to assume £ to be a discrete random variable. 
The continuous case of the theorem can be recovered from the discrete one by a standard 
argument (approximating the continuous distribution by a discrete one while holding n 
fixed). 

For short, we denote the vector (an, . . . , aj n ) by aj. We begin by applying Theorem 13.11 
Lemma 5.1. Let e < 1, and B be positive constants. Assume that 

p = sup P x , y ( | y^OyOi + fi)(yj + fj) - a\ < P) > rT B . 

a . . 

hJ 

Then the following holds with probability at least 3p/4 with respect to y = (y\, . . . , y n ). 
There exist a proper symmetric GAP Q y of rank Os,e(^) and size m&x(C'B t t(p~ 1 /n 6 / 2 ), 1) 
and a set I y of n — n e indices such that (aj,y) is f3-close to Q y for each i G I y . 
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Proof, (of Lemma 15. ip For short we write 

n 

^aijixi + fi)(yj + fj) = J2( Xi + /*)( a *' (y + f ))- 

i,j i=l 
We say that a vector y = (yi, . . . , y n ) is good if 



p x (i + (y + f )) - o| < /3) > /V4. 

i=l 

We call y 6ad otherwise. 

Let G denote the collection of good vectors. We are going to estimate the probability p of 
a randomly chosen vector y = (yi, ■ ■ ■ , y n ) being bad by an averaging method. 



n 

P y P x (| ^2(x t + fi)(ai, (y + f))-a\<P)=p 

pp/A + 1 - p > p 
(l-p)/(l-p/4)>p. 

Thus, the probability of a randomly chosen y belonging to G is at least 

l-p>(3p/4)/(l-p/4)>3p/4. 
Consider a good vector y € G. By definition, we have 



Px(| J>i + (y + f)) - a| < /3) > p/4. 

i=l 

Thus 



supP x (| (y + f)}-6| <P) >P/ 4 - 

6 i=i 

Next, if (aj, (y + f)) =0 for all i, then the conclusion of the theorem holds trivially. 
Otherwise, we apply Theorem 13.11 to the sequence (a$, (y + f )), i = 1, . . . , n, obtaining a 
proper symmetric GAP Q y of rank Ob )£ (1) and size max(OB,e(p~ 1 /n e / 2 ), 1), together with 
its elements %(y) such that |(aj,y) — %(y)| < /3 for all « € J y , a set of at least n — n € . □ 
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We now work with the cfo(y). 
Recall that 
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By Theorem 14. 1\ we may assume that the qi (y ) span Q y . 



P y (y eG)> 3 P /4. (6) 

Next, for each y G G, we choose from I y s indices i yi , . . . , i Vs such that q^ y . (y) span Q y , 
where s is the rank of Q y . We note that s = 0^(1) for all s. 

Consider the tuples (i yi , . . . ,i Vs ) for all y G G. Because there are ^ s Ob )£ (ji s ) = n° B >^ 
possibilities these tuples can take, there exists a tuple, say (1, ... , r) (by rearranging the 
rows of A if needed) such that (i yi ,i Vs ) = (1, . . . , r) for all y 6 G', and a subset G" of 
G satisfying 

P y (y G G') > P y (y G G)/n° c ' e ( r> = pjn *^. (7) 
For each 1 < i < r, we express ^i(y) in terms of the generators of Q y for each y G G', 

%(y) = ca(y)#i(y) H h Cj r (y)# r (y), 

where Cji(y), • • • Cj r (y) are integers bounded by n - 8 -^ 1 ), and 5i(y) are the generators of Q y . 
We will show that there are many y that correspond to the same coefficients Cij. 



Consider the collection of the coefficient-tuples ^(cn(y), . . . , ci r (y)) ; . . . ; (c r i(y), . . . c rr (y) 
for all y G G' . Because the number of possibilities these tuples can take is at most 



There exists a coefficient-tuple, say ^(cn, . . . , ci r ), . . . , (c r i, . . . c rr )^ , such that 

((cn(y), • • . ,c ir (y)); . . . ; (c ri (y),. . . c rr (y))) = ((cu,. . . ,c ir ),. . . , (c r i, . . . c rr )) 
for all y G G", a subset of G' satisfying 

P y (y e G") > P y (y G G')/n° B ^ > pjn *^. (8) 

In summary, there exist r tuples (en, . . . , ci r ), . . . , (c r i, . . . c rr ), whose components are in- 
tegers bounded by n° B ' t<yl \ such that the followings hold for all y G G" . 

• <?i(y) = c».i5i(y) H H c jr g r (y), for i = 1, . . . , r. 
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• The vectors (en , . . . ,ci r ), . . . , {c T \ , . . . c rr ) span Z rank ( ( 5y) . 

Next, because |/ y | >n — n e for each y G G", by an averaging argument, there is a set / of 
size at least n — 2rf such that for each i G / we have 

P y (»e/ y jeG")>P y (yeG")/2. (9) 

Fix an arbitrary row a of index from /. We concentrate on those y G G" where the index 
of a belongs to I y . 

Because q(y) G Q y (q(y) is the element from Q y that is /3-close to (a, y)), we can write 

<?(y) = ci(y)5i(y) + . . . c r (y)5f r (y) 
where c,(y) are integers bounded by n° B > e ^\ 

For short, for each i we denote by Vj the vector (en, . . . ,Q r ), we will also denote by v a>y 
the vector (ci(y), . . . c r (y)). 

Because Q y is spanned by </i(y), • • • , q r (y), we have k = det(vi, . . . v r ) ^ 0, and that 

kq(y) + det(v &iy , v 2 , . . . , v r )gi(y) H h det(v aiy , vi, . . . , v r _i)g r (y) = 0. 

Next, because each coefficient of the identity above is bounded by n° B -^ l \ there exists a 
subset G a of G" such that all y G G a correspond to the same identity, and 

P y (y G G a ) > (P y (y G G")/2)/(n°^( 1 ))'- = /Vn°^ W . (10) 

In other words, there exist integers k\,...,k r depending on a, all bounded by n° B ^ l \ such 
that 

kq{y) + fcitfi(y) + • • • + k r q r {y) = (11) 

for all y G G a . 

Note that k is independent of a and y. It is crucial to note that, since qi(y) is /3-close to 
(a;, (y + f)>. It follows from CTI]) that for all y G G a 

|fc(a, (y + f)) + fci(ai, (y + f)) + • • • + Ma r , (y + f))| < n° B ' e(1) /3. (12) 



Next we introduce the following matrix. 
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Definition 5.2 (Row matrix R). For each i £ I, consider a = Yi(A) and those k±, . . . , k r 
from (|12p . We define the i-th row of R as 



ri(R) := (A;i,...,A; r ,0,... 5 0,A;,0,...,0). (13) 

The other entries of R are zero, except the diagonal terms (R)u, where i £ I, which are set 
to be one. 



It is clear from the definition of R that 



Let ^' := RA. It follows from flTZD that 



PyQ{y + f,Ti(A'))\ < (3n° B ^)> p/n° B ^ = n~° B ^ (15) 

for any i £ I, hence obtaining the conclusion of the theorem for the case £ having discrete 
distribution. 

To recover the continuous case, we approximate £ by a discrete distribution while holding 
f3,n,f- and aij fixed. By taking the limit, we again obtain (|15|) for some R. 



6. proof of Theorem 13.31 



In this section we will use the results from Section [5] to prove Theorem 13.31 

Let U be a random subset of {1, . . . ,n}, where P(i £ U) = 1/2 for each i. Let Ajj be a 
matrix of size n by n defined by 




aij if either i G U and j ^ U, or z ^ C7 and j € J7, 
otherwise. 



We first apply the following lemma. 

Lemma 6.1 (Concentration for bilinear forms controls concentration for quadratic forms). 
Let U be given. Assume that 



swpP x (\} j a i j(xi + fi)(xj + fj) - a\ < P) = p>n 

a . . 

where x = (xi, . . . , x n ) and Xi are iid copies of £. 
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Then 

Pv,w(| MifiviWjl = Ofl(/3\/iog")) > P 8 /((2vr) 7/2 exp(4vr)), (16) 

where v = (vi, . . . ,v n ), w = (u>i, . . . ,w n ), and Vi,Wj are iid copies of £ — 
We refer the reader to Appendix [B] for a proof of this lemma. 

By the definition of £, it is clear that the random variable £ — £' also satisfies Condition 
[1] (with different positive parameters). We next apply Theorem 13.21 to f)16|) to obtain the 
following lemma. 

Lemma 6.2. There exist a set Io(U) of size 0# je (l) and a se£ /(£/) of size at least n — n € , 
and a nonzero integer k(U) bounded by n° B ^ 1 ^ such that for any i & I, there are integers 
ku (U),io G Iq(U), all bounded by n° B -^ , such that 

P y {\{k(U) Bi (A u )+Y i k Uo (U) aio (A u ),y)\<f3n°^) = n-°^ 1 \ 

where y = (y%, . . . , y n ) and yi are iid copies of £ — 

Note that Lemma [6.21 holds for all XJ C [n]. We will establish a similar result for the original 
matrix A. 

As I (U) C [n]° B . £ W and fe(C7) < n, there are only n 03 ^ 1 ) possibilities that (I (U),k(U)) 
can take. Thus there exists a tuple (Jo, k) such that Io(U) = Iq and fc(£7) = k for 2 n /n° s ' £ ( 1 ) 
different sets U. Let us denote this set of U by Thus 

|W| > 2 n /n° s ' e(1) - 

Next, let I be the collection of i which belong to at least |W|/2 index sets Ijj- Then we have 



+ (n- \I\)\U\/2 > {n-n e )\U\ 
\I\ > n- 2n e . 

Fix an i G I. Consider the tuples (ku (U),io G Jo) where i G If/. Because there are only 
n Os, e (l) possibilities such tuples can take, there must be a tuple, say (ku ,io G io), such 
that (k Uo (U),i G I ) = fe ,i G J ) for at least |W|/2n° fl .^ 1 ) = 2 n /n° s ' £(1) sets £7. 

Because |/o| = 0B,e(l), it is easy to see that there is a way to partition Io into I U Iq such 
that there are 2 n /n° B ^ l "> sets U above satisfying that Iq C £7 and £7 n Iq = 0. Let W/^' 
denote the collection of these £7. 
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By passing to consider a subset of Up i» if needed, we may assume that either i ^ U or 
i G U for all ?7 G Up p>. Without loss of generality, we assume the first case that i G U. 
(The other case can be treated similarly). 

Let U G Up p^ and u = (it l3 . . . , u n ) be its characteristic vector (uj = 1 if j £ U, and Uj = 
otherwise). 

By the definition of Ajj, and because I' Q D ?7 = and Iq C {/, for i G Iq and i ' G Iq we can 
write 



( a i(,(^c/) ; y) = ^2 a i' j u jyj, and ( a ^'(^c/),y> = XAoit 1 ~ n i)%- 

i=i j=i 
Also, because i ^ U, we have 



Thus, 



(fcai(Ai/),y)+ 5^(*ji a<o(^i/),y) 



ioeio 



(Aa<(Ai/),y) + ( ^ k^a^Au)^) + { ^ fc^a^A^y) 



t t-l J tJ 



= ka ijUjyj + h "'.,"■:,!",":, + EE ^'^'iC 1 ~ 

i=i i=i i' a' 3=1 i'^in 

n n 

= ^2(k aij + Y k u' o a i' j ~ Yl k ii'i a i'ii) u jyi + EE k i% a i'£iVr 
Next, by Lemma 16.21 for each U G Up p' we have 

P y (|(fcai(^),y)+ J2(k iioaio (Au),y)\ =0((3n°e-M)) = „- ^«. 
Also, note that 

\U P j» I = 2 n /n° s ' e(1) . 



Hence, 
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E y E [/ (fc(a i (A c/ ),y) + £ {k iio ^ {Au), y> = 0(/3n° s ' e(1) )) > rr°^« 
By applying the Cauchy-Schwarz inequality, we obtain 



n 



-ObA 1 ) 



< 



E y E C7 (A:(a J (yl L r),y) + (A^a^^y) = 0(/3n° B ^)) 



ioG-fo 



< E, 



E, 



ioG-fo 



E u (^(feaij + ^ k ii' a i' j ~ ^2 fe < a ^'j')%% + X] X] fc < a *o'i^' = °(/ 3n ° S ' £(1) )) 
i=i i&e/fi tfeJ? i=H'el£ 



< E y E U)U ,(^(% + ^ fc^a^ - ^ ^a )K - ^j)Vj = 0{fin° B ^)) 

3=1 iJeJj 



j"pT" 



i"pT" 



where := (uj — u'j)yj, and in the last inequality we used the simple observation that 



Eu,u'(/(u) = 0(Pn°»*W), f(u') = 0(fin ^)) < E u , u ,(/(u) - f(u') = 0(fin ^)). 



Note that uj — u'j are iid copies of r/ 1 / 2 ). Hence zj are iid copies of r/ 1//2 )(<!; — £'), where 
7/(V2) is independent of £ and 

Clearly, we may assume that J D Jo = by throwing away those i £ I that belong to Jo. 
We introduce the following matrix R. 



Definition 6.3. 



{R)n ■= < 



kij , if j € Jq ; 

— kij if j € Iq ; 

k, ifj = i; 

0, otherwise 



(17) 



for each i £ J; the other entries of R are zero, except the diagonal terms (R)u, where i ^ J, 
are defined to be one. 



It is clear that the matrix A' = RA satisfies the conclusion of Theorem 13.31 completing the 
proof. 
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7. proof of Theorem 12.51 

We first apply Theorem 13.31 to a« to obtain a matrix A' and a set |/| > n — 2n e such that 
for any i £ I, 

P,(|(z,ri(i4')}| < n- A+ ° B ^) > rT° B ^\ (18) 

Ideally, our next step is to apply Theorem 13.11 to the Ti(A'). However, the application is 
meaningful only when ||rj(j4')|| is relatively large. Investigating the degenerate case is our 
next goal. 

Set 

K = n- A l\ 

We consider two cases. 

Case 1. {degenerate case) ||rj(j4')|| < K for all i 6 I. Hence, 

||*Ti(A) + % r i(^)ll = 11^(^)11 < K. (19) 

Next, because Ylj ll c j(-^)ll 2 = 1) there exists an index jo such that ||c JO (yl)|| > n -1 / 2 . 
Consider this column vector. 

It follows from (|19p that for any i E I, 

\kcj (i) + kij c jo 

U)\<K. 

The above inequality means that the components Cj (i) of Cj Q (A) belong to a GAP generated 
by Cj (j)/k,j G Iq , up to an error K. This suggests us the following approximation. 

For each j ^ /, we approximate Cj (j) by a number Vj of the form (l/[2Er _1 J) • Z such that 
\ v j ~ c io(i)l — We nex t se ^ 

Vi \ = ^ ^ kij Vj I k 
ieib 

for any i & I. 

Thus, Vi belongs to a GAP of rank Ob i<e (1) and size n° B ^^ for all i € I. 
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With v = (yi,..., v n -i), we have 

||v-c i0 (A)|| <Kn° B ' e(1) . 
Furthermore, by Condition [21 and because (cj (^4), rj(M„_i)) = for i ^ jo, we infer that 

|(v,rj(M n _i))| < Krf 3 '^. 
Note that ||v|| 3> re -1 / 2 . Set u := [1/II V IIJ ' v > we then obtain 

• |(u,rj(M n _i))| < n -^/2+o s , £ (i) f or n _ 2 rows of M n _ x . 

• There exists a GAP of rank Ob i£ (1) and size n° B ' e ^ that contains at least n — 2n e 
components Uj. 

• All the components Ui, and all the generators of the GAP are rational numbers of 
the form p/q, where \p\, \q\ < n A / 2+ ° B - e ^' . 

Case 2. (non- degenerate case). There exists io G I such that ||rj (A')|| > K. Because 
r io (A') = kr io (A) + ^je/ k ioj r j( A )i r i ( A ') is orthogonal to n - \I \ - 1 = n - Ob, £ (1) 
column vectors of M n _i. Consequently, because M n _i is symmetric, rj (A') is orthogonal 
to n — Ob )£ (1) row vectors of M„_i. 

Set 

v:=r i0 (A')/||r l0 (A')||. 

Hence, (v, r,(M n _i)) = for at least n — Ob i£ (1) row vectors of M n _i. 
Also, it follows from (fTSj) that 

P z (|(z, v)| < n -^/2+o s , £ (i)) > n -Ofl,«(l)_ (20) 

Next, because the Zj satisfy Condition [H Theorem 13.11 applying to ()20p implies that v can 
be approximated by a vector u as follows. 

• \ui-Vi\< n~ A/2+ ° B ^ for all i. 

• There exists a GAP of rank (9b i£ (1) and size n° B ' c ^ that contains at least n — n e 
components U{. 

• All the components Ui, and all the generators of the GAP are rational numbers of 
the form p/q, where \p\, \q\ < n A ' 2+ ° B ' e ^' . 
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Note that, by the approximation above, we have ||u|| x 1 and |(u, rj(M n _i))| < n a/^+ObA 1 ) 
for at least n — (3b j(E (1) row vectors of M n _i. 



8. Proof of Theorem 12.61 

We first bound the number N of vectors u satisfying the conclusion of Theorem 12.61 

Because each GAP is determined by its generators and dimensions, the number of Qs is 
bounded by (n A+0s ' e ( 1 )) 0s . e ( 1 )(n 0s ' e(1) ) 0s - e(1) = n ° A ' B -^ 1 \ 

Next, for a given Q of rank Ob i£ (1) and size n° B < €(1 \ there are at most n n ~ 2ne \Q\ n ~ 2nt = 
n o s , e (n) wa y S ^ choose the n — 2n e components Ui that Q contains. 

The remaining components belong to the set {p/q, \p\, \ q\ < n A ^ 2+ ° B ' e ^}, so there are at 
most ^ n A +°B^(i)^2n c _ n o A ,B,d nC ) ways to choose them. 

Hence, we obtain the key bound 



N < n °A,B A 1 ) n °B ,A n ) n °A,BA nt ) = n OsA n ) _ (21) 

Therefore, for our task of proving Theorem 12.61 it would be ideal if we can show that the 
probability P^ (u) that |(u, rj(M„_i))| < /3q for n-Og, e (l) rows of M n _i are smaller than 
exp(— fi(n))/iV for each u, where {3q := n~ A ^ 2+ ° B ^^ is the bound from the conclusion of 
Theorem 12.51 



Roughly speaking, our strategy is to classify u into two classes: one contains of u of very 
small P^ (u), and thus their contribution is negligible; the other contains of u of relatively 
large P^ (u). To deal with those u of the second type, we will not control P/3 ( u ) directly 
but pass to a class of new vectors u' that are also almost orthogonal to many rows of M n _i , 
while the probability Yl^'Poi 11 ') are relatively smaller than ^■f > /3o( u )- More details follow. 

8.1. Technical reductions and key observations. By paying a factor of n° B > E W in 
probability, we may assume that |(u,rj(M n _i))| < /3q for the first n-Oe ]£ (l) rows of M n _i 
(because M n _i is symmetric, this is the case of most correlations among the entries). Also, 
by paying another factor of n n in probability, we may assume that the first no components 
Ui of u belong to a GAP Q, and u no > l/2\/n — 1, where no ■= n — 2n e . We refer the 
remaining Uj as exceptional components. Note that these extra factors do not affect our 
final bound exp(— f2(n)). 

For given (3 > and i < no, we define 



py(u) := supP Xit ... >Xno (\xiUi H h x no u no - a\ < /3), 

a 

where x«, . . . , x nQ are iid copies of £. 
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A crucial observation is that, by exposing the rows of M n _i one by one, and due to symme- 
try, the probability P J g(u) that |(u, rj(M n _i))| < j3 for alH < n — Ob can be bounded 
by 



P/3(u) < Yl supP Xi! ... tXn _ 1 (\x i u i -\ h x n -iu n -i - a\ < /3) 

l<i<n-O fl , e (l) a 

< ] j supP Xiv .. jXno (|xiUi H h x no n no - a| < j$) 

l<i<no a 

= n ppw- ( 22 ) 

l<fe<no 



Also, because of Condition [T] and u no > — 1, for any ft < c\/2yjn — 1 we have 

p« fc) (u) < sup Pa, (\x no u no -a\<f3) 

a 

< l-c 3 , (23) 

and thus, 



P^U) <(1-C 3 r = (l-C 3 



,(l-o(l))n 



Next, let C be a sufficiently large constant depending on B and e. We classify u into two 
classes B and depending on whether P^ (u) > n~ Cn or not. 



Because of (|30j) . and C is large enough, 



Pft( u ) < n° s ' t(n) /™ C ™ < n" n/2 . (24) 

For the rest of the section, we focus on u £ ,6. Depending on the distribution of the 
sequences (p^(u)), we consider two cases. 

8.2. Approximation for degenerate vectors. Let B\ be the collection of u S B sat- 
isfying the following property: for any n' = n 1 ~ e components , . . . , Ui , among the 
u\ , . . . , u no , we have 

supP* Xi / (|« il x <1 + --- + u iftl x <B , -a\ <n~ B - 4 ) > (n')- 1/2+o(1) . (25) 

a n 

For consision we set (3 = n~ B ~ A . It follows from Theorem 13.11 that . among any m x , . . . , U{ n , , 
there are, say, at least n'/2+ 1 components that belong to an interval of length 2/3 (because 



2-1 



HOI H. NGUYEN 



our GAP now has only one element). A simple argument then implies that there is an 
interval of length 2/3 that contains all but n' — 1 components U{. (To prove this, arrange 
the components in increasing order, then all but perhaps the first n'/2 and the last n'/2 
components will belong to an interval of length 2/3). 

Thus there exists a vector u' £ (2/3) • Z satisfying the following conditions. 

• \ui — u'^ < 2/3 for all i. 

• u'a = u for at least uq — n' indices i. 

Because of the approximation, whenever |(u, r-j(M n _i))| < /3q, we have 



\(u',r t (M n ^))\ < n B+2 (2p) + /3 = 0. 
It is clear, from the bound of /3 and /3q, that /3' < C2/2y/n — 1, and thus by ([23]) . 



P^(u') < (l-c 3 ) 



(l-o(l))n 



Now we bound the number of u' obtained from the approximation. First, there are 
{n n - n v+ n ') = 0(n 2n " £ ) ways to choose those v! i that take the same value u, and there 
are just 0(/3 _1 ) ways to choose u. The remaining components belong to the set (2/3) -1 • Z, 
and thus there are at most 0((/3~ 1 ) n ~ n ° +n ') = 0(n° A - B '^ n E )) ways to choose them. 

Hence we obtain the total bound 



J^P^(u') < 0(n 2nl ~ E )0(n°^( nl " E ))(l -c 3 ) (1 -° (1))n 

u' 

< (1 -c 3 )^'°^ n . 

8.3. Approximation for non-degenerate vectors. Assume that u G £>2 := B\B\. By 
exposing the rows of M n _i accordingly, and by paying an extra factor ( n ?) = 0(n n E ) in 
probability, we may assume that the components u no _ n / + i, . . . ,u no satisfy the property 



SUpPx no _ n , +1 ,...,x no (|Wn -n'+l X no-n'+l H ^ U nQ X no - d\ < V u ' ) < ill' 



~ B ~ A ) < (n'r 1/2+o{l) 

< n l/2- £ /2 +0 (l)_ (26) 

Next, define the following sequence /3k, k > 0, 



p k+1 := (2n B+2 + l)P k . 
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l<i<no—n' 

Roughly speaking, the reason we truncated the product here is that whenever i < riQ — n 1 ^ 6 , 
and /3 k is small enough, the terms pty(u) are smaller than (n')~ l l 2+olyl \ owing to (|26p . This 
fact will allow us to gain some significant factors when applying Theorem 13,11 



Note that 7rg fc (u) increases with k, and recall that vro(u) > n Cn . Thus, by the pigeon-hole 
principle, there exists fco := ko(u) < Ce" 1 such that 



It is crucial to note that, since A was chosen to be sufficiently large compared to Ob, £ (1) 
and C, we have 



Pka+l < n- B -\ 

Having mentioned the upper bound of pty(u), we now turn to its lower bound. Because of 
Condition [21 and m < 1, the following trivial bound holds for any /3 > /3q and i < n$ — n', 



We next divide the interval I = [ n -^/2+o B , e (i) j „i/2-e/2- (i)] into K = ( A / 2 + Bje (l))e- 1 
sub-intervals I k = [ n - A / 2 +o s , E (i)+fee ) n -A/2+o s , £ (i)+(fc+i)ej > p or s j 10 rt ) we denote by p k the 
left endpoint of each I k . Thus p k = n ~ A /2+o B , E (i)+fce_ 

With all the necessary settings above, we now classify u basing on the distributions of the 
Pfil (u), 1 < % < n - n 1_e . 

For each < ko < Ce~ l and each tuple (mo, . . . , ttik) satisfying mo + • • • + ttik = 
n 1-6 , we lei 
conditions. 

• fc (u) = k 



n 1 e , we let B^°'-' mK) denote the collect ion of those u from B2 that satisfy the following 



• There are exactly m k terms of the sequence {pp k ( u )) belonging to the interval I k . 
In other words, if mo + • • • + m k -\ + 1 < % < mo + • • • + m k then p^ (u) G I k . 

Pk 

Now we will use Theorem 13.11 to approximate u 6 ,m as follows. 
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• First step. Consider each index % in the range 1 < i < mo- Because p R G Iq, 
we apply Theorem 13.11 to approximate Ui by v! { such that |itj — u[\ < /3, and the 
u- belong to a GAP Q of rank Ob, € (1) and size O(p 1 /n 1 / 2 ~ e ) for all but n 1_2e 
indices i. Furthermore, all have the form (3^ -p/q, where |p| , |g| = 0(nf3^) = 
0(n A/2+0 B , e (l)y 

• k-th step, 1 < k < K. We focus on i from the range no + • • • + n£_i + 1 < i < 

tiq + ■ ■■ + rifc. Because p^ 0+ ' +nfc - 1+1 ) g j fc we apply Theorem 13. II to approximate 

pfc 

Uj by v! i such that |itj — u^j < /3fc and iij belongs to a GAP of rank Os )£ (l) and 
size 0(p'^ 1 /n l l 2 ~ t ) for all but n 1_2<E indices i. Furthermore, all u[ have the form 
Pko-p/q, where |g| = O^" 1 ) = 0(n A / 2 +° B ^). 

• For the remaining components m, we just simply approximate them by the closest 
point in /3j • Z. 

We have thus provided an approximation of u by u' satisfying the following properties. 

(1) \m - < f3k for all i. 

(2) u'i G Qk for all but n 1_2e indices i in the range mo+- • -+mfc_i + l < i < mo+- • -+mfc. 

(3) All the u[, including the generators of Qk, belong to the set /3k ' {p/li \p\Al\ ^ 
n A/2+0 B , e (i) \ 

(4) Q k has rank B)£ (1) and size = 0(p^ 1 /n l/2 ~ e ). 

Let / j^ ll '"'' mif ) De the collection of all u' obtained from u G g(^ ll >'"> mK ) as above. Observe 
that, as |(u, rj(M n _i))| < /3q for all? < n — Ob j£ (1), we have 



Ku'^iCM^i))! < (n B+2 + l)/3 i0 . (28) 



Hence, in order to justify Theorem 12.61 in the case u S 02, it suffices to show that the 

fe 



probability that (gSD holds for all i < n - B , e (l), for some u' G B'^ 1 ''"'™*', is small. 



Consider a u' G j^gj* 1 ""'™*) an d the probability P u / that J2SJI holds for aU i < n- O b , £ (1) 
We have 



P U ' < 11 supP Xi: ... :Xno (\u' l x l + --- + u' n _ 1 x no -a\ < (n B + 2 + l)/3 i0 ) 

l<i<no— n 1— e 

< J][ supP a!i) ... )a;no (|ui2;j H hn n _ix no - o| < (2n B+2 + l)/3 io ; 

a 



^ fcn+1 (u) < n en 7r^ (u) 
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where in the last inequality we used ([27)) . 

We recall from the definition of gfa* 1,, "' mjr ) that 

K K 



^ ( u )^n^i=- e(mi+ - +mfc) n^ fc 

k=l k=l 



k=i 



Hence, 



A' 



P U ' <n 2en Hp™K (29) 



k=l 



In the next step we bound the size of B'^ 1 '"' ' mK \ 



Because each is determined by its Os j(; (l) generators from the set /3j • {p/q, \p\, \q\ < 
n A/2+o s , £ (i)|^ an( j j|. g dimensions from the integers bounded by rP 3 '^, there are n° A < B ^^ 
ways to choose each Q^. So the total number of ways to choose Q\, . . . , Qk is bounded by 



( n OA,s, e (l))A = n O A , B , e (l)_ 
Next, after locating Q^, the number iVi of ways to choose uZ from each Q k is 



k\ 



k=i v 7 

K 

< 2 m i + - +m '< \Q 

k=l 

<(0(l)) n UPk mk /n {1 
k=i 

K 

< II Pk" 1 ' /n (1 / 2 - e -°^ n 



K 

k I 

k=l 

K 

/2-e)(mi+-+m fc ) 



k=l 



where we used the bound \Q k \ = O^ 1 /n 1 / 2 ^). 

The remaining components u'a can take any value from the set Pi -{p/q, \p\, \q\ < n A l 2+ ° B >^ 1 )}, 
so the number N2 of ways to choose them is bounded by 
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Putting the bound for JVi and A^ together, we obtain a bound A" for | j g / ^ ll '—' w - K ')| 



K 

N <HPk mk M 1/2 - e -° {1))n . (30) 
fc=i 

It follows from ([22]) and ([30]) that 



£ Pu' < n 2m J] # * II /^/n^— °M)n < n ~(V2-3e-o(l))n (31) 

u / 6 m! m i.'-. m if) fc=l fc=l 

fc 

Summing over the choices of ko and (mi, . . . , mx) we obtain the bound 



^ ^ Pu' < n-( 1 /2-3,-o(l))n ! 



fco,mi 



u'ee 



,("1.- ,m_ff) 



completing the proof of Theorem 12.61 



9. Proof of Corollary II .81 



Assume that the upper diagonal entries of M n satisfy the conditions of Corollary 11.81 We 
denote by Ai < A2 < • • • < A n the real eigenvalues of M n . 

Our first ingredient is the following special form of the spectral concentration result of 
Guionnet and Zeitouni. 

Lemma 9.1. |10t Theorem 1.1] Assume that f is a convex Lipschitz function. Then for 
any 6 > 8 := 16C^\f\ L /n, 



J2f(X i )--E(J2f(X i ))\>Sn) <4exp(- 



n 2 {5-8 f 
' 16C 2 |/|| 



Following [4J and [9], we will apply the above theorem to the cut-off functions f*(x) := 
log(max(e, x)) and f^{x) = log(max(e, — x)), for some e > to be determined. The main 
reason we have to truncate the log function is because it is not Lipschitz. Note that / + 
and / _1 both have Lipschitz constant e . Although they are not convex, it is easy to 
write them as difference of convex functions of Lipschitz constant 0(e _1 ), and so Lemma 
19.11 applies. Thus the followings hold for 5 3> (en) -1 
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P|| ^ logAi-E( ^ logAOl > 5n ) < exp(-8(nW)) 



and 



P | log |Ai| - E( ^ log |Ai|)| > 5n ) < exp(-0(nW)), 

where 5+ := {Aj, Aj > e} and S~ := {Aj, Aj < — e}. 
Hence, 



P | log|Ai|-E( Y log|Ai|)|>25n <exp(-9(n 2 5 2 e 2 )). (32) 

\ Xiesrust \i£srus+ j 

Roughly speaking, (j32|) implies that I1a esrus + 1^*1 ^ s weu concentrated around its mean. 
It thus remains to control the factor R := n|A;|<e 1^1 • We will bound R away from zero, 
relying on Theorem 11.51 and Lemma 19.21 below. 

Lemma 9.2. [31| Proposition 66], |5, Theorem 5.1] Assume that M n is a random symmetric 
matrix of entries satisfying the conditions of Corollary \1.8l Then for all I C R with 
\I\ > K 2 log 2 n/n 1 / 2 , one has 

with probability 1 — exp(a;(log n)), where Nj is the number of Aj belonging to I. 

By Lemma [9. 2\ we have \{i, |Aj| < e}| <C n 1 / 2 e. Also, Theorem 1 1 . 51 implies that minj{|Aj|} > 
n~ A with probability 1 — 0(n~ B ). Thus 

R= TT |Ai| > (minllA,!})™ 17 ^ = n~ ^\ (33) 
|A»|<e 

Our next goal is the following result. 

Proposition 9.3. With probability 1 — n u W we have 



[] |A,|=exp(-0(e- 2 ))E( ]J | A,|) - exp(^^). (34) 

\iesrus+ x t esrus+ 



Let us complete the proof of Corollary 11.81 assuming Proposition 19.31 
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Firstly, because n Ai6Sru5 + M > Ui=i \MI^ St U5e+I > 117=1 |A 4 | = | det(M„)|, it follows 
from Proposition 19.31 that with probability 1 — n u W, 

[] | Ai| = exp(-0(e- 2 ))E(| det(M n )|) - exp(^^). (35) 

\i£srus+ 

Secondly, by ([33]), the following holds with probability 1 — 0(n~~ B ) 

|det(M n )| = H |Ai| J] \h\ > n-°^ 1/2 ^ 11 \Xi\. 
\i$srus+ \i€srus+ Xiesrus+ 

Combining with (|35p . we have 

| det(M n )| = exp(-0(e- 2 + en 1 / 2 logn))E(| det(M n |) - n-° (nl/2e) exp(^^). 

By choosing e = n -1 / 6 , we obtain the conclusion of Corollarv ll.8( noting that E(| det(M n )|) S> 
exp(n). 

It remains to prove Proposition 19.31 

Proof, (of Proposition I9.3p Set 5 := ^fp, which satisfies the condition of Lemma 19. II as n 
is sufficiently large. By (f32|) we have 

P(| Yl lo gl^!- E ( E log|A,|)|>^^)<exp(-e(log 2 n)) 

\i&srus+ \iesrus+ 

P(| J] |A,|-exp(E( Yl log|A,|)|)>exp(^^))<exp(-e(log 2 n)). (36) 

\i£srus+ \i£srus+ 

Set 

U:= Yl lo gl A *|- E ( E lo §l Ai l)- 

XieSruSt \iesrus+ 

We have E(C7) = 0. Thus, by Jensen inequality and by ([32]) . 

1 < E(exp(C/)) < E(exp(|C/|)) < 1 + j|exp(i)P(|i7| >t)= exp(0( e - 2 )). (37) 
Observe that 
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E(exp([7))=E( H |Ai|)/exp(E( £ log|A 4 |)). 



It thus follows from (1371) that 



exp(E( Yl log|A l |)) = exp(-0( e - 2 ))E( ]J |A,|). 
This relation, together with (|36p . imply that with probability 1 — n^ l \ 

[] |Ai| = exp(-0( e - 2 ))E( J] |Ai|) - exp(^^ 



□ 



Appendix A. Proof of Lemma 12721 

Assume that vi , . . . , G R n are independent vectors that span H . Also, without loss of 
generality, we assume that the subvectors (uu, . . . , v\k), ■ ■ ■ , (vki, • • • , v^k) generate a full 
space of dimension k. 

Consider a random vector u = (/i + x%, . . . , f n + x n ), where iid copies of £. 

If u £ H, then there exist ai, . . . , otk such that 



U = ^ CtiVi 



i=l 



Note that ai, . . . ,ak are uniquely determined once the first k components of u are exposed. 
Thus we have 



k 

p(ueH)< ii P;Cj (x j + / J = ^a i% )<(v / r^r- fe , 

k+l<j i=l 



where in the last estimate we use the fact (which follows from Condition [T]) that sup a P(£ = 
a) < V 1 - C3- 
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Appendix B. Proof of Lemma 16.11 



The goal of this section is to establish the inequality 

P ViVV (\J2Au(ij)v lWj \ = B>e (P^/]^n)) > p 8 /((2vr) 7/2 exp(87r)), 

hi 

under the assumption that 

supP x (| y]aij(xi + fi)(xj + fj) -a\< (3) = p> 



n~ B . 



Set 



a 'ij aij/P- 



We have 



supP x (| ^a'ijixi + fi)(xj + fj) - a'\ < 1) = sup P x (| ^ a^Xi + fi){xj + fj) - a\ < 1) 



a m hj 



We next write 

P x (| J^a'^Xi + fi)(xj + fj) - a'\ < 1) = P(exp(-|(^a^.(x i + f^Xj + f 3 ) - a') 2 > exp(- 

< exp(|)E x exp(-^(^a^(x i + f^Xj + /,) - a') 2 ). 

hi 

Note that exp(-fx 2 ) = J R e(xt) exp(-f t 2 )dt. Thus 



Px(|^4(^+/i)(^+/i)-a'| < 1) < exp(|) / \E^e[(Y,a' ij (x i +f l )(xj+fj))t]\exp(-^t 2 )dt 
< exp(|)v / 2^^ |E x e[(^a^(^ + fi)(xj + /,))t]|(exp(-^i 2 )/V2^. 



Consider x as (x(/,Xfj), where X[/,xp are the vectors corresponding to i G U and i U 
respectively. By the Cauchy-Schwarz inequality we have 



ON THE LEAST SINGULAR VALUE OF RANDOM SYMMETRIC MATRICES 



33 



< 



< 



+ 



|E x e((^ a'^Xi + fi)(xj + fj))t)\ 2 (expi—t 2 ) /y/^)dt 
E X[7 |E x -e((^ a'^ + h){x 3 + f 3 ))t)\ 2 (eM~t 2 )/V^)dt 
/ E X(,E X x 



i<=U,j£U 



< / E, 



R 



ieujeti 



+ 



£ + + /,) - {x\ + + fj)))t) 2 (exp(-|t 2 )/v / 2^)dt 

ieU,j€U 

I E XUjXi7iXo y.e(( ^ a^.(x i -xO(x j -x;.))t)(exp(-^t 2 )/v / 2^)^ 

.-& e (( E ao-J/i^)«)(exp(-|t 2 )/V^F)rft, 



ieujeu 

where yj/ = x;y — and = xp- — x'^, whose entries are iid copies of £ — 
Thus we have 



f |E x e((^a^^)t)|(exp(-^ 2 )/v^)^l 8 < [ f E yy , z -e(( £ a^)t) (exp(-^ 2 )/v^)eft 



< 



/ E yUtZ .y ujZ ,_e{( 4jVi*j- £ a^.^4)t)(exp(-^ 2 )/v^)^ 



i&U,j€U 



ieujeu 



Because a' :4 = a'-,-, we can write the last term as 

1 j j l 



j Ey^.^-e^ £ a Jl (-4)^)t)(exp(-|t 2 )/v^)dt 
= ^E VjW e(( o- J -u i «;j+ ^ a^ViW^t) (exp(-|t 2 )/v / 2^)^, 



idU,j&U 



ieu,jeu 
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where v := (y v , -z'^) and w := (y^, z ). 
Next, we observe that 



/ E VjW e(( ^2 a'ijViWj+ ^ a'^ViW^tj (exp(-^t 2 )/v / 2vr)(it 
R ieujeu ieujeu 

= (l/v^F)Ev, w exp(-|(^ ^(z?)^) 2 ), 

where 



= Au(ij)/p. 

Thus 



p 8 = (P x (|^a^,x, -a'| < 1)) < exp(4vr)(2vr) 4 ( / |E x e((^ a^x l x j )t)|(exp(-|t 2 )/V 
»,j R »,j 

< exp(47r)(2vr) 7 / 2 E ViW exp(-|(^^(ij) / ^^) 2 )- 

i,j 

Because p > n~ B , the inequality above implies that 

P v , w (| Y^Au{ij)'viWj\ = Ob^v 7 !^)) > P 8 /((2^) T/2 exp(47r)). 

Scaling back to Aij, we obtain 

P^wd^^^iK^I = B , e (Py/tegn)) > p 8 /((2vr) 7/2 exp(47r)), 
hi 

completing the proof. 

Acknowledgements. The author would like to thank R. Pemantle and V. Vu for very 
helpful discussions and comments. 

References 

[1] A. Barvinok, Polynomial time algorithms to approximate permanents and mixed discriminants within 

a simply exponential factor, Random Structures Algorithms 14 (1999), 29-61. 
[2] B. Bollobas, Random Graphs, Academic Press, New York. 

[3] K. Costello, T. Tao and V. Vu, Random symmetric matrices are almost surely non-singular, Duke Math. 
J. 135 (2006), 395-413. 



ON THE LEAST SINGULAR VALUE OF RANDOM SYMMETRIC MATRICES 



35 



[4] K. P. Costello and V. Vu, Concentration of random determinants and permanent estimators, Siam J. 

Discrete Math 23 (2009), 1356-1371. 
[5] L. Erdos, B. Schlein and H-T. Yau, Wegner estimate and level repulsion for Wigner random matrices, 

Int. Math. Res. Notices, 3 (2010), 436-479. 
[6] P. Erdos, On a lemma of Littlewood and Offord, Bull. Amer. Math. Soc. 51 (1945), 898-902. 
[7] P. Erdos and L. Moser, Elementary Problems and Solutions: Solutions: E736. Amer. Math. Monthly, 

54 (1947), no. 4, 229-230. 

[8] C. G. Esseen, On the Kolmogorov-Rogozm inequality for the concentration function, Z. Wahrsch. Verw. 
Gebiete 5 (1966), 210-216. 

[9] S. Friedland, B. Rider, and O. Zeitouni, Concentration of permanent estimators for certain large ma- 
trices, Ann. Appl. Probab., 14 (2004) 1559-1576. 
[10] A. Guionnet and O. Zeitouni, Concentration of the spectral measure for large matrices, Electron. Comm. 
Probab. 5 (2000) 119-136. 

[11] G. Halasz, Estimates for the concentration function of combinatorial number theory and probability, 

Period. Math. Hungar. 8 (1977), no. 3-4, 197-211. 
[12] M. Kanter, Probability inequalities for convex sets, J. Multivariate Anal, 6 (1976), 222-236. 
[13] G. Katona, On a conjecture of Erdos and a stronger form of Sperner's theorem. Studia Sci. Math. 

Hungar 1 (1966), 59-63. 

[14] D. Kleitman, On a lemma of Littlewood and Offord on the distributions of linear combinations of vectors, 

Advances in Math. 5 (1970), 155-157. 
[15] A. Kolmogorov, Two uniform limit theorems for sums of independent random variables, Theor. Probab. 

Appl. 1 (1956), 384-394. 

[16] J. E. Littlewood and A. C. Offord, On the number of real roots of a random algebraic equation. III. Rec. 

Math. Mat. Sbornik N.S. 12 , (1943). 277-286. 
[17] H. Nguyen, Inverse Littlewood- Offord problems and the singularity of random symmetric matrices, 

http : //arxiv . org/abs/1 10 1 . 3074 1 submitted. 
[18] H. Nguyen and V. Vu, Optimal Littlewood- Offord theorems, to appear in Advances in Mathematics. 
[19] A. Odlyzko, On subspaces spanned by random selections of ±1 vectors, J. Combin. Theory Ser. A 47 

(1988), no. 1, 124-133. 

[20] B. A. Rogozin, An estimate for concentration functions, Theor. Probab. Appl. 6 (1961), 94-97. 

[21] M. Rudelson and R. Vershynin, The Littlewood-Offord Problem and invertibility of random matrices, 

Advances in Mathematics 218 (2008), 600-633. 
[22] A. Sarkozy and E. Szemeredi, liber ein Problem von Erdos und Moser, Acta Arithmetica 11 (1965), 

205-208. 

[23] S. Smale, On the efficiency of algorithms of analysis, Bullentin of the AMS (13) (1985), 87-121. 

[24] D. A. Spielman and S. H. Teng, Smoothed analysis of algorithms: why the simplex algorithm usually 

takes polynomial time, J.ACM 51 (2004), no. 3, 385463. 
[25] R. Stanley, Weyl groups, the hard Lefschetz theorem, and the Sperner property, SIAM J. Algebraic 

Discrete Methods 1 (1980), no. 2, 168-184. 
[26] T. Tao and V. Vu, From the Littlewood-Offord problem to the circular law: universality of the spectral 

distribution of random matrices, Bull. Amer. Math. Soc. (N.S.) 46 (2009), no. 3, 377-396. 
[27] T. Tao and V. Vu, On ±1 matrices: singularity and determinant, Random Structures Algorithms 28 

(2006), 1-23. 

[28] T. Tao and V. Vu, Inverse Littlewood-Offord theorems and the condition number of random matrices, 

Annals of Mathematics (2) 169 (2009), no 2, 595-632. 
[29] T. Tao and V. Vu, On the singularity probability of random Bernoulli matrices, Journal of the A. M. S 

20 (2007), 603-673. 

[30] T. Tao and V. Vu, Random matrices: The Circular Law, Communication in Contemporary Mathematics 
10 (2008), 261-307. 

[31] T. Tao and V. Vu, Random matrices: universality of local eigenvalue statistics, to appear, Acta Math. 
[32] T. Tao and V. Vu, Smooth analysis of the condition number and the least singular value, (to appear in 

Mathematics of Computation). 
[33] T. Tao and V. Vu, Additive Combinatorics, Cambridge Univ. Press, 2006. 



36 



HOI H. NGUYEN 



Department of Mathematics, University of Pennsylvania, 209 South 33rd Street, Philadel- 
phia, PA 19104, USA 

E-mail address: hoing@math.upenn.edu 



